A spectral clustering method for microarray data

نویسندگان

  • David Tritchler
  • Shafagh Fallah
  • Joseph Beyene
چکیده

This paper considers a clustering method motivated by a multivariate analysis of variance model and computationally based on eigenanalysis (thus the term “spectral” in the title). Our focus is on large problems, and we present the method in the context of clustering genes using microarray expression data. We provide an e5cient computational algorithm and discuss its properties and interpretation in statistical and geometric terms. Leukemia and Melanoma data sets are analyzed to demonstrate the use of the method, and simulations are carried out to compare our method with two other clustering algorithms. We extend the method to enable supervision by either gene or array characteristics. c © 2004 Elsevier B.V. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis

Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...

متن کامل

Segmentation of cDNA Microarray Images using Parallel Spectral Clustering

Microarray Image Microarray technology generates large amounts of expression level of genes to be analyzed simultaneously. This analysis implies microarray image segmentation to extract the quantitative information from spots. Spectral clustering is one of the most relevant unsupervised methods able to gather data without a priori information on shapes or locality. We propose and test on micro...

متن کامل

Spectral Clustering Gene Ontology Terms to Group Genes by Function

With the invention of biotechnological high throughput methods like DNA microarrays, biologists are capable of producing huge amounts of data. During the analysis of such data the need for a grouping of the genes according to their biological function arises. In this paper, we propose a method that provides such a grouping. As functional information, we use Gene Ontology terms. Our method clust...

متن کامل

Hybrid Algorithm for Clustering of Microarray Data

Clustering is a crucial step in the analysis of gene expression data. Its goal is to identify the natural clusters and provide a reliable estimate of the number of distinct clusters in a given data set. In this paper we propose new hybrid algorithm for clustering of microarray data based on spectral clustering and k-means. Our algorithm consist of four steps, including preprocessing or filterin...

متن کامل

به کارگیری روش‌های خوشه‌بندی در ریزآرایه DNA

Background: Microarray DNA technology has paved the way for investigators to expressed thousands of genes in a short time. Analysis of this big amount of raw data includes normalization, clustering and classification. The present study surveys the application of clustering technique in microarray DNA analysis. Materials and methods: We analyzed data of Van’t Veer et al study dealing with BRCA1...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 49  شماره 

صفحات  -

تاریخ انتشار 2005